智能论文笔记

SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

Bao Hieu Tran , Thanh Le-Cong , Huu Manh Nguyen , Duc Anh Le , Thanh Hung Nguyen , Phi Le Nguyen

分类：计算机视觉 | 机器学习

2022-01-01

在过去的几十年中，由于其在广泛的应用中，现场文本认可从学术界和实际用户获得了全世界的关注。尽管在光学字符识别方面取得了成就，但由于诸如扭曲或不规则布局等固有问题，现场文本识别仍然具有挑战性。大多数现有方法主要利用基于复发或卷积的神经网络。然而，虽然经常性的神经网络（RNN）通常由于顺序计算而遭受慢的训练速度，并且遇到消失的梯度或瓶颈，但CNN在复杂性和性能之间衡量折衷。在本文中，我们介绍了SAFL，一种基于自我关注的神经网络模型，具有场景文本识别的焦点损失，克服现有方法的限制。使用焦损而不是负值对数似然有助于模型更多地关注低频样本训练。此外，为应对扭曲和不规则文本，我们在传递到识别网络之前，我们利用空间变换（STN）来纠正文本。我们执行实验以比较拟议模型的性能与七个基准。数值结果表明，我们的模型实现了最佳性能。

translated by 谷歌翻译

Federated PCA on Grassmann Manifold for Anomaly Detection in IoT Networks

Tung-Anh Nguyen , Jiayu He , Long Tan Le , Wei Bao , Nguyen H. Tran

分类：机器学习

2022-12-23

In the era of Internet of Things (IoT), network-wide anomaly detection is a crucial part of monitoring IoT networks due to the inherent security vulnerabilities of most IoT devices. Principal Components Analysis (PCA) has been proposed to separate network traffics into two disjoint subspaces corresponding to normal and malicious behaviors for anomaly detection. However, the privacy concerns and limitations of devices' computing resources compromise the practical effectiveness of PCA. We propose a federated PCA-based Grassmannian optimization framework that coordinates IoT devices to aggregate a joint profile of normal network behaviors for anomaly detection. First, we introduce a privacy-preserving federated PCA framework to simultaneously capture the profile of various IoT devices' traffic. Then, we investigate the alternating direction method of multipliers gradient-based learning on the Grassmann manifold to guarantee fast training and the absence of detecting latency using limited computational resources. Empirical results on the NSL-KDD dataset demonstrate that our method outperforms baseline approaches. Finally, we show that the Grassmann manifold algorithm is highly adapted for IoT anomaly detection, which permits drastically reducing the analysis time of the system. To the best of our knowledge, this is the first federated PCA algorithm for anomaly detection meeting the requirements of IoT networks.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Online pseudo labeling for polyp segmentation with momentum networks

Toan Pham Van , Linh Bao Doan , Thanh Tung Nguyen , Duc Trung Tran , Quan Van Nguyen , Dinh Viet Sang

分类：计算机视觉

2022-09-29

语义分割是开发医学图像诊断系统的重要任务。但是，构建注释的医疗数据集很昂贵。因此，在这种情况下，半监督方法很重要。在半监督学习中，标签的质量在模型性能中起着至关重要的作用。在这项工作中，我们提出了一种新的伪标签策略，可提高用于培训学生网络的伪标签的质量。我们遵循多阶段的半监督训练方法，该方法在标记的数据集上训练教师模型，然后使用训练有素的老师将伪标签渲染用于学生培训。通过这样做，伪标签将被更新，并且随着培训的进度更加精确。上一个和我们的方法之间的关键区别在于，我们在学生培训过程中更新教师模型。因此，在学生培训过程中，提高了伪标签的质量。我们还提出了一种简单但有效的策略，以使用动量模型来提高伪标签的质量 - 训练过程中原始模型的慢复制版本。通过应用动量模型与学生培训期间的重新渲染伪标签相结合，我们在五个数据集中平均达到了84.1％的骰子分数（即Kvarsir，CVC-ClinicdB，Etis-laribpolypdb，cvc-colondb，cvc-colondb，cvc-colondb和cvc-300）和CVC-300）只有20％的数据集用作标记数据。我们的结果超过了3％的共同实践，甚至在某些数据集中取得了完全监督的结果。我们的源代码和预培训模型可在https://github.com/sun-asterisk-research/online学习SSL上找到

translated by 谷歌翻译

TLETA: Deep Transfer Learning and Integrated Cellular Knowledge for Estimated Time of Arrival Prediction

Hieu Tran , Son Nguyen , I-Ling Yen , Farokh Bastani

分类：机器学习

2022-06-17

车辆到达时间预测已被广泛研究。随着物联网设备和深度学习技术的出现，估计的到达时间（ETA）已成为智能运输系统中的关键组成部分。尽管ETA存在许多工具，但由于特殊车辆的交通数据有限，ETA的特殊车辆（例如救护车，消防车等）仍然具有挑战性。现有作品使用一种模型用于所有类型的车辆，这可能会导致精确度较低。为了解决这个问题，作为该领域的第一个，我们为驾驶时间预测提出了一个深度转移学习框架TLETA。 TLETA构建了细胞时空知识网格，用于提取驾驶模式，并结合道路网络结构嵌入以构建ETA的深神经网络。 Tleta包含可转移的层，以支持不同类别的车辆之间的知识转移。重要的是，我们的转移模型仅训练最后一层以绘制转移的知识，从而大大减少了训练时间。实验研究表明，我们的模型以高精度预测旅行时间，并胜过许多最先进的方法。

translated by 谷歌翻译

Corrupting Data to Remove Deceptive Perturbation: Using Preprocessing Method to Improve System Robustness

Hieu Le , Hans Walker , Dung Tran , Peter Chin

分类：计算机视觉 | 机器学习

2022-01-05

虽然深度神经网络在分类任务方面取得了很大的表现，但最近的研究表明，训练有素的网络可以通过添加微妙的噪音来欺骗。本文介绍了一种新方法，通过将恢复过程应用于自然训练的分类器的顶部来提高神经网络鲁棒性。在这种方法中，图像将被一些重要操作员故意破坏，然后在通过分类器之前恢复。Sargan - 生成对抗网络（GaN）的延伸能够去噪雷达信号。本文将显示Sargan还可以通过去除对抗效应来恢复损坏的图像。我们的结果表明，这种方法确实提高了自然培训的网络的性能。

translated by 谷歌翻译

Predicting Job Titles from Job Descriptions with Multi-label Text Classification

Hieu Trung Tran , Hanh Hong Phuc Vo , Son T. Luu

分类：自然语言处理

2021-12-21

寻找合适的工作和狩猎符合条件的候选人对求职和人力资源机构来说很重要。通过关于职位描述的广泛信息，员工和雇主需要帮助，以根据职位描述文本自动检测职位标题。在本文中，我们提出了用于预测作业描述文本的相关职位标题的多标签分类方法，并实现具有不同预先训练的语言模型的BI-GRU-LSTM-CNN来申请作业标题预测问题。具有多语言预先训练模型的伯特获得了开发和测试集的F1分数的最高结果，该组在开发集中为62.20％，测试集47.44％。

translated by 谷歌翻译

A novel multi-view deep learning approach for BI-RADS and density assessment of mammograms

Huyen T. X. Nguyen , Sam B. Tran , Dung B. Nguyen , Hieu H. Pham , Ha Q. Nguyen

分类：计算机视觉

2021-12-08

高级深度学习（DL）算法可以预测患者基于乳房成像报告和数据系统（BI-RAD）和密度标准的患者发育乳腺癌的风险。最近的研究表明，多视图分析的结合改善了整体乳房考试分类。在本文中，我们提出了一种新的多视图DL方法，用于乳房X线照片的Bi-RAD和密度评估。所提出的方法首先部署深度卷积网络，用于分别对每个视图进行特征提取。然后将提取的特征堆叠并馈入光梯度升压机（LightGBM）分类器中以预测Bi-RAD和密度分数。我们对内部乳房数据集和公共数据集数字数据库进行广泛的实验，用于筛选乳房X线摄影（DDSM）。实验结果表明，所提出的方法在两个基准数据集中突出了巨大的边距（内部数据集5％，DDSM数据集10％）优于两个基准分类方法。这些结果突出了组合多视图信息来改善乳腺癌风险预测性能的重要作用。

translated by 谷歌翻译

Automatically Detecting Cyberbullying Comments on Online Game Forums

Hanh Hong-Phuc Vo , Hieu Trung Tran , Son T. Luu

分类：自然语言处理

2021-06-03

在线游戏论坛对大多数游戏玩家都很受欢迎。他们用它来沟通和讨论游戏的策略，甚至结交朋友。然而，游戏论坛还包含滥用和骚扰演讲，令人不安和威胁的球员。因此，有必要自动检测和删除网络欺凌评论，以保持游戏论坛清洁和友好。我们使用从魔兽世界（WOW）和联盟（LOL）论坛（LOL）论坛和火车分类模型中收集的网络欺凌数据集，以自动检测玩家的评论是否是滥用的。结果获得了LOL论坛的82.69％的宏F1分数，并通过网络伯文数据集的毒性BERT模型为哇论坛的83.86％的宏F1分数。

translated by 谷歌翻译

Invalidator: Automated Patch Correctness Assessment via Semantic and Syntactic Reasoning

Thanh Le-Cong , Duc-Minh Luong , Xuan Bach D. Le , David Lo , Nhat-Hoa Tran , Bui Quang-Huy , Quyet-Thang Huynh

分类：机器学习

2023-01-03

In this paper, we propose a novel technique, namely INVALIDATOR, to automatically assess the correctness of APR-generated patches via semantic and syntactic reasoning. INVALIDATOR reasons about program semantic via program invariants while it also captures program syntax via language semantic learned from large code corpus using the pre-trained language model. Given a buggy program and the developer-patched program, INVALIDATOR infers likely invariants on both programs. Then, INVALIDATOR determines that a APR-generated patch overfits if: (1) it violates correct specifications or (2) maintains errors behaviors of the original buggy program. In case our approach fails to determine an overfitting patch based on invariants, INVALIDATOR utilizes a trained model from labeled patches to assess patch correctness based on program syntax. The benefit of INVALIDATOR is three-fold. First, INVALIDATOR is able to leverage both semantic and syntactic reasoning to enhance its discriminant capability. Second, INVALIDATOR does not require new test cases to be generated but instead only relies on the current test suite and uses invariant inference to generalize the behaviors of a program. Third, INVALIDATOR is fully automated. We have conducted our experiments on a dataset of 885 patches generated on real-world programs in Defects4J. Experiment results show that INVALIDATOR correctly classified 79% overfitting patches, accounting for 23% more overfitting patches being detected by the best baseline. INVALIDATOR also substantially outperforms the best baselines by 14% and 19% in terms of Accuracy and F-Measure, respectively.

translated by 谷歌翻译